A Study of the Use and Evaluation of Con dence Measures in Automatic Speech Recognition

نویسنده

  • Gethin Williams
چکیده

Con dence measures have been found to be useful for a number tasks within the eld of Automatic Speech Recognition (ASR). For example, the use of con dence measures has been reported in the utterance veri cation, keyword spotting and Out-of-Vocabulary (OOV) word spotting literature. In this report, it is shown that so called 'hybrid Arti cial Neural Network/Hidden Markov Model' (HMM/ANN) systems are well suited to the task of generating con dence measures, due to their ability to provide local phone class posterior probability estimates which may be used to generate con dence measures in a computationally e cient manner. A number of evaluation metrics are also described and the performance of ve con dence measures derived from the ABBOT hybrid HMM/ANN system for the tasks of utterance veri cation and OOV word spotting are evaluated using these metrics. Besides the tasks described above, con dence measures may also be used for tasks such as ltering the acoustics for a number of conditions prior to decoding, guiding the search over the space of alternative decodings and also as diagnostic tools. The use of con dence measures as diagnostic tools can indicate how components of an ASR system can be improved. For example, con dence measures can be used to identify poor pronunciation models and also to guide the search for improved models, where the generation of accurate pronunciation models remains an important challenge in the creation of practical recognition systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Word-based confidence measures as a guide for stack search in speech recognition

The Maximum a posteriori hypothesis is treated as the decoded truth in speech recognition. However, since the word recognition accuracy is not 100%, it is desirable to have an independent con dence measure on how good the maximum a posteriori hypothesis is relative to the spoken truth for some applications. E orts are in progress [1, 2, 3] to develop such con dence measures with the intent of a...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Automatic speech recognition using acoustic confidence conditioned language models

A modi ed decoding algorithm for automatic speech recognition (ASR) will be described which facilitates a closer coupling between the acoustic and language modeling components of a speech recognition system. This closer coupling is obtained by extracting word level measures of acoustic con dence during decoding, and making coded representations of these con dence measures available to the ASR n...

متن کامل

Improved estimation, evaluation and applications of confidence measures for speech recognition

This paper describes our approach to the estimation of con dence in the words generated by a speech recognition system. We describe the models and the features employed for con dence estimation. In addition we discuss the characteristics of an information-theoretic metric for assessing the performance of the con dence measure. We provide a simple application of con dence measures in which we ra...

متن کامل

Détection de mots clés dans un flux de parole

The automatic speech recognition currently arouses a great interest. In particular, the keyword detection which constitutes a signi cant branch of the human-machine interaction and which can help us to communicate with our machines in a natural and direct way by using spontaneous speech. This technique consists in detecting in a pronounced sentence, the keywords characterizing the application a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998